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1. WinPSK Overview 



1.1. Introduction 

PSK31 is an amateur radio communications mode introduced by Peter Martinez, G3PLX , that 
uses phase modulation and special character coding. It provides robust narrow bandwidth keyboard Chat 
type communications between two or more stations. 

This document was written to describe some of the internal workings of the WinPSK program that 
was developed as a result of my experimenting with DSP on a PC soundcard. Previously, experimenting 
with DSP was achieved using evaluation boards from various DSP chip manufacturers. Programming 
these boards was tedious due to their assembly language and fixed-point number representation. Trying 
to learn the basics of DSP often got lost in the details of programming and debugging these specialty 
processors. 

Beginning with the Intel 486 and subsequent Pentium class processors being used in the popular 
desktop PC platform, the processing power has increased to the point where real time signal processing 
can now be done using floating point arithmetic and a PC soundcard for analog I/O. The amateur radio 
community has benefited from these advances with PC Soundcard based applications for SSTV, RTTY, 
and more recently, PSK31 . 

My interest in all this was in learning how to develop and program various DSP communications 
algorithms using a standard Windows 1 based PC platform. It is from these experiments that WinPSK 
evolved from basically a DSP test bed to a simple functioning program for PSK31 . This paper describes 
in some detail the basic design decisions that were made during this learning process that led to the final 
program. It is not meant as a definitive reference on PSK31 implementation but just an engineering 
notebook describing this program. Perhaps others can build on some of the information here to improve 
this program as well as be motivated to experiment with new modes. 

The basic goal was to write a working PSK31 interface program from scratch. Unlike some other 
HF modes, Peter Martinez has made available very complete specifications for this mode 2 , 3 . Also his 
Windows program "PSKsbw" provided an excellent reference program for verifying and testing various 
algorithms. 

"WinPSK User Guide" is a separate document that describes the user operation of the program. 
This document only describes the inner workings of the program. 

The program will be described in sections starting with signal generation then followed by the 
reception algorithms. Finally, the overall software architecture and miscellaneous issues will be 
discussed. 
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2. WinPSK Signal Generation 

Creating a PSK31 signal is done in stages starting with character input to final waveform output to the 
soundcard. r 

2.1. Block Diagram 




2.2. Input Characters 

oliT SB !l dS r? T^f S 8 b l Ch ^ e,S - ° thrOUgh 127 are the standard ASC » characters and 128 to 
255 are extended characters. WinPSK ignores most control codes below the SP(/Ox30) character to 
reduce some of the garbage from making it to the screen. Some of the Windows controls also behave 
badly when a control character is sent to them. oenave 

2.3. Varicode Encoding 

The first step in PSK31 encoding is to map the 8 bit fixed length input characters into variable lenqth 
characters. By mapping most used characters into shorter codes and least used characters into lonqer 
codes, the overall data transfer speed can be increased. This is similar to Morse code where common 
letters are shorter sequences. The letter 'e' occurs more often in text than a '? so it has a varicode of '1 1* 
while a z has a code of '1 1 1010101'. Notice that lowercase letters have shorter codes than upper case 
letters. This is why one should not use all uppercase when using PSK31 since the varicode was 
optimized for lowercase letters. ~ 

Since the character data is sent serially, some means of separating characters is also needed. This is 
accomplished in PSK31 by specifying that two or more consecutive zero bits separate each character 
This also places the requirement that each character code cannot contain more than one consecutive" 
zero. It also means each code must start and end with a one. With these requirements the Varicode code 
table was specified. The vancode words from the table are sent msb first. If a new character is not readv 
in time to be sent. Zeros are padded into the data stream. 



Example bit stream of varicoded character sequence "abc": 
.. .00101 10 0101 1111 00 1011 11 00 
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Varicode 


















Output 




Output 






1010101011 




1010111101 




110111101 


192 


11011101111 




1011011011 




1111101 






193 


11011110101 




1011101101 


B' 


11101011 




111010101 


194 


11011110111 


ETX 


1101110111 


•c 


10101101 




111010111 


195 


11011111011 




1011101011 


'D' 


10110101 


32 


111011011 


196 


11011111101 


ENQ 


1101011111 


'E' 


1110111 




111011101 


197 






1011101111 


'F 


11011011 




111011111 


198 


11101010101 




1011111101 


'G' 


11111101 


135 






11101010111 




1011111111 




101010101 


136 


1111101101 


200 


11101011011 








1111111 


137 


1111101111 


201 


11101011101 








111111101 


138 


1111110101 


202 






1101101111 




101111101 




1111110111 


203 


11101101011 




1011011101 


V 


11010111 


140 


1111111011 


204 


11101101101 






w 


10111011 




1111111101 


205 






1101110101 




11011101 


142 


1111111111 


206 


11101110101 




1110101011 




10101011 


143 


10101010101 


207 


11101110111 




1011110111 




11010101 


144 


10101010111 


208 


11101111011 




1011110101 




111011101 


145 


10101011011 








1110101101 


'R' 




146 


10101011101 








1110101111 


,g, 


1101111 


147 




211 


11110101011 




1101011011 


T 






10101101011 


212 


11110101101 




1101101011 






149 


10101101101 


213 






1101101101 






150 _^ 


10101101111 




11110110101 




1101010111 






151 




215 


11110110111 




1101111011 


X 


101110101 


152 


10101110111 


216 


11110111011 




1101111101 




101111011 




10101111011 


217 


11110111101 




1110110111 


T 


1010101101 


154 


10101111101 


218 


11110111111 




1101010101 






155 




219 


11111010101 




1101011101 


•v 




156 


10110101011 


220 


11111010111 




1110111011 


T J 


111111011 


157 


10110101101 


221 


11111011011 




1011111011 




1010111111 


158 


10110101111 


222 






1101111111 




101101101 


159 


10110110101 


223 


11111011111 










160 


10110110111 


224 


11111101011 








1011 


161 


10110111011 








101011111 




1011111 


162 


10110111101 


226 


11111101111 




111110101 


'C 


101111 


163 




227 


11111110101 




111011011 




101101 


164 


10111010101 








1011010101 








10111010111 


229 








1010111011 


•f 


111101 


166 


10111011011 


230 


11111 


11101 




101111111 




1011011 


167 


10111011101 


231 


11111 


11111 








101011 


168 


10111011111 


232 


10101010101 










1101 


169 


10111101011 


233 


10101010110 






101101111 


T 


111101011 


170 


10111101101 


234 


10101010111 








V 


10111111 


171 


10111101111 


235 


10101011010 






1110101 




11011 


172 


10111110101 


236 


10101011011 






110101 




111011 


173 


10111110111 


237 


10101011101 






1010111 








10111111011 


238 


10101011110 




r 


110101111 




111 


175 


10111111101 


239 


10101011111 




'0' 


10110111 


V 


111111 


176 


10111111111 


240 


10101101010 






10111101 


■q' 


110111111 


177 


11010101011 


241 


10101101011 




'2' 


11101101 


r 


10101 


178 


11010101101 


242 






'3' 




's' 


10111 


179 


11010101111 


243 


10101101110 








r 


101 


180 


11010110101 


244 


10101101111 




'5' 


101011011 






181 


11010110111 




10101110101 




•6' 






1111011 


182 


11010111011 








7" 


110101101 


w 


1101011 


183 


11010111101 


247 


10101110111 




'8' 


110101011 


x 1 


11011111 








10101111010 




■9' 


110110111 


V 


1011101 


185 


11011010101 


249 


10101111011 






11110101 


'2' 


111010101 




11011010111 


250 


10101111101 






110111101 


T 


1010110111 




11011011011 


251 


10101111110 






111101101 


T 


110111011 


188 


11011011101 


252 


10101111111 






1010101 


T 


1010110101 




11011011111 


253 


10110101010 






111010111 




1011010111 




11011101011 


254 


10110101011 




T 




DEL 


1110110101 


191 


11011101101 


255 


10110101101 
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2.4. BPSK Serialization 

signal phase does not na,e to be known. For SSS^^I"' "T^""" 

BPSK "Zero" Data Bit BPSK "One" Data Bit 
(symbol='ip') (symboNW) 



180 Deg. Change 



No Change 



Symbol 
•00' 
■10 1 



Bit 

1 No Change 
0 180 deg. Changi 



Varicode Word 

.nIaLLLX X 
<-|_Shift RegisterJ 
ige — 1 



2.5. QPSK Serialization 

information to allow for error correction. 3St ' PSK31 uses the extra 



QPSK -1 0- symbol QPSK W symbol QPSK '01 • symbol QPSK "1 1 • symbol 




180 deg. change 



no change 



ft 



+90 deg. shift -90 deg. shift 



2.5.1. ECC Encoding Method 



number of bits used to spread the redundancy. 
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Logically, a shift register is used to shift in each data bit. By exclusive OR'ing certain bits together, the 
desired symbol encoding is performed. The bit pattems(polynomials) which are used for exclusive OR'ing 
determines how well the system will be able to correct errors. The two polynomials used in PSK31 are: 



G0(x) = x* + x 2 +x' + x" 

The following diagram shows how the polynomials are used to generate a two bit symbol for every input 




Symbol ^ s0 



Note that the data bits from the varicode word are inverted before entering the shift register. This is so 
that the idle stream of all zeros will produce symbols of '10' which are 180 degree phase shifts. This is 
useful for maintaining symbol sync on the receiver side and being compatible with BPSK. 

Varicode Word 



Symbol 

00 = NO CHANGE < 
10 =180 CHANGE 

01 = +90 CHANGE <~ 
11= -90 CHANGE 



vLLLLL X 31.25 Hz 
[ shift register <j — symbol 

\|/\l/sj/j/\|x 



Convolution 
Code Look-up 
table 



The QPSK encoder is actually implemented using a look-up table rather than using exclusive OR gates. 



2.6. Differential Phase Shift encoding 

The next step is to take the two bit symbol and convert it into the actual signal phase state. Depending on 
the previous signal phase, there are 4 signal phase possibilities for each new symbol. A simple state 
machine takes the present phase state information and the new symbol to come up with the next signal 
phase state. In WinPSK this is done using state tables. 
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2.7. Wave Shaping and Carrier Generation 

The common way to create angle modulated signals is to combine two sinusoidal waveforms whose 

XT£ LHL ? 1 freqUenCy 3nd th3t are 90 degrees out of P hase fra ™ each other By 
addmg these two signals in different proportions, a signal of any desired phase can be created The two 
signals are referred to as l(in phase) and (^quadrature phase). meiwo 



sin(wct) 



cos (wet) 

The following MathCad 4 simulation shows how a BPSK signal could be created using the UQ method. 

carrier amplitude A := — 

Symbol frequency Fs := 31.25 
carrier frequency Fc := 150 

Carrier equations 
Ic(t) := A sin(2-7i t Fc) Qc(t) := Acos(2-Jut-Fc) 



Modulation equations 
I(t) := i?(t<.032),l,-l) Q( t ):= i?(t<.032), 1 1) 



This is a 180 degree phase shift followed by No phase shift. 

BPSKVectore 
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bpsk(t):= l(t)-[c(t)*Q(t)-Qc(t) 
BPSK. Signal 




0.02 0.04 0.06 0.08 



Note the abrupt phase change at time t = .032 seconds. This is not desirable since it makes the PSK 
signal very wide. One way to limit the bandwidth would be to filter the output signal. PSK31 uses a 
different method by using waveshaping on the I and Q input signals so that instead of abruptly going from 
a 1 to a -1 , the signal makes a cosine shaped transition between -1 and 1 . 



Here is the same MathCad simulation except that the I and Q modulation signals are no longer 
rectangular, but are cosine shaped. 

Fc := 400 carrier frequency A := — carrier amplitude 

Fs:=3l.25 Symbol frequency t - 0,. 000032 ..0.099 Plot range 
carrier equations 

Ic(t) := A sir<2-7t t Fc) Qc(t) := Acos(2-7tt-Fc) 

modulation equations 
I(t):=if((t<.048),sir(n Fs t),-l) Q(t) := i?(t<.048),sir(7l Fs t),- 1) 




This is a 180 degree phase shift followed by No phase shift. 
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Note he gradual transition from one phase state to the next. This results in a much narrower bandwidth 
signal without the need for any post filtering. Also it can be seen that the amplitude of the signal is not 
constant. This means that the transmitter must not compress or limit the audio waveform otherwise the 
signal will again get much wider in bandwidth. This is perhaps the biggest problem with setting up a 
PSK31 station. It is very easy to overdrive and distort the PSK31 signal by applying the relatively high 
amplitude audio signal from the PC soundcard into the low level microphone input of a SSB transmitter 
There is no easy way to monitor ones own signal for purity so one must rely on other's signal reports 
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Finally here is a MathCad simulation showing a QPSK signal that changes 180 degrees then by -90 
degrees. 

Modulation equations 




0 0.05 O.I 0 0.05 0.1 

This is a 180 degree phase shift followed by a -90 degree phase shift. 



QPSK Vectors 



Q(0 




I(t) 

qpsk(t):=I(t)Ic(t) + Q(t)Qc(t) 
QPSK Signal 




0 0.02 0.04 0.06 0.08 



Note that during 90 degree phase changes, the amplitude does not drop all the way to zero as in the 180 
degree case. 
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2.8. Power Spectrum 

The BPSK/QPSK signal has a power spectrum consisting of a large main lobe centered around the 
earner frequency out to a null at the earner frequency + /- 31 .25 Hz. There are muNipS lobe ertendina 
out to infinity but their amplitudes continue to drop. exienaing 
Here is a MathCad simulation of the PSK31 power spectrum 6 . 



f:=- 140,-139.2.. 139.2 
Sdb(f):= 101og(S(0) + 14.9 



Power Spectrum of PS 
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3. WinPSK Signal Detection 



3.1. Block Diagram 

The following block diagram shows the major functions implemented by WinPSK to receive PSK31 
signals. Receiver audio is captured by the PC soundcard and processed into final ASCII characters for 
display. This schemeis by no means the most optimal method but is the method that evolved from a lot of 
experimentation with various architectures for WinPSK. 
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3.2. Soundcard Input 



ir audio is sampled by the soundcard into 1 6 bit samples at a 1 1 025 Hz rate and converted 
into floating point representation for the remainder of the processing. convened 

3.3. Decimation by 2 

A block bandlimits the signal to 2700 Hz and performs a sample rate conversion down 
to 5512.5 Hz. A 45 Tap FIR decimation fitter is used to perform this task. The following is a freauenrv 
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A 


ft 























IO00 1500 2000 2500 3000 35O0 



4000 4500 5000 



This real signal is fed to the FFT and realtime display section for tuning and visual signal monitoring It is 
also sent on to the next stage of the PSK decoder. 9 ' 

3.4. Complex Mixer 

The next stage converts the real audio input into a complex baseband signal centered around the users 
center frequency set point. An NCO(numerically controlled oscillator) is implemented as a sin(wt) antf! 
cos(wt) frequency source where co is a control input from the users center frequency setpoint and also the 
AFC control signal that is denved further downstream. These two frequencies are 90 decrees apart and 
are multiplied by the real input signal to create two data streams called I and Q. 

/ (0 = fnput(t) cos(fijr) Q( t ) = Input(t) sm(wt) 

The following code segment performs all these functions: 

//Generate complex sample by mixing input sample with NCO's sin/cos 
Inptrl->x = pm[i] * cos ( vcophz ); //generate I and Q signals 
Inptrl->y = pln[i] * sin ( vcophz ); 

vcophz = vcophz + (mjohzinc + freqerror) ; //update NCO 
if (vcophz > PI2) //handle 2 Pi wrap around 

vcophz -= PI2; 
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3.5. Decimation by 9 

The complex signal is then down sampled again by 9 to further reduce the sampling rate to about 20 
times the signal bandwidth. Two stages of decimation by 3 are used rather than one. It is more 
processor efficient to break it up rather than have a fairly long FIR running at the highest sample rate. 
Each stage is identical and has a frequency response that is the same except that the one is scaled 
infrequency by 3. Since the signal is complex, the filters are run on both the I and Q signal. 



27 Tap Decimation by 3 FIRfotago 1) 




0 120 240 360 480 600 720 840 960 1080 1200 



27 Tap Decimation by 3 FIR(stage 2) 
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200 240 280 320 360 
1c 



The final sampling frequency is now 11025/18 or 612.5 Hz. 
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3.6. Matched Data Bit filter 

The final system bandwidth is set by this FIR filter. This filter has two purposes One is to provide a 
magnitude response that provides the best signal to noise ratio in order to extract the da a Sal torn the 
noise. The second thing it must do is minimize any ISI(lnter-Symbol lnterfe re nee)^^^S5S K 
fransmitter. signal path, and receiver system. With PSK31 there is no ISI fmmSSsmSo^cess 
due to the wave shaping of the signal, so any ISI will come from the signal path and the reS tor E 
Since the HF signal path is not predictable the best one can do is minimize the ISI generated by £ 
receiver b, FIR filter. I could find no books or papers on the ideal filter for this mode ofTsK By 
expenmentation ,t appears the ideal filter would a "brickwair low pass filter with a flat maqnrtude resoonse 
out to one-half the bit rate of 31.25 Hz or 15.625 Hz. It must aho have a minimurr tamSTteay 
These requirements seem to be almost mutually exclusive since the more ideal the filter the lonqer the 
delay. A compromise filter was developed that gives fairly low ISI and a reasonable cutoff shape Better 
ISf? P r t ab ' y ° Ut there bu \ W0 T "! d POt Pr ° Vide 3 Wh0le l0t of no,iceab,e PerforrSan^pSmenf 
fhinnL e R T onm f h nt - J he addition of interleaving or longer ECC codes would probably make 

a bigger difference. Below is the frequency response of the bit filter. 
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3.7. Frequency Error filter 

Unfortunately, the AFC(automatic frequency control) block could not use the output of the bit filter for 
locking on to the incoming signal frequency. The problem is that the bit filter is too narrow and the AFC 
can lock onto either side of the PSK31 idle signal which looks like two carriers spaced 15.625 Hz above 
and below the center frequency. The solution though wasteful was to use a separate filter just for the 
frequency control that was wide enough that it cannot distinguish between the PSK31 idle peaks. The 
response is only 6 dB down at 31 .25 Hz so it spans both idle peaks. 

79 tap FIR Frequency Error Filter 
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The AGC is derived from the average signal magnitude using the scheme shown below The I and Q 
signals are then divided by the this AGC signal to help keep the average amplitude constant for the 
remainder of the processing. 



I(t) 

Q(t)— 



2 2 
I +Q 



IIR Slow 
— ) Decay 
LP Filter 



IIR Fast 
Attack 
LP Filter 



Gain 
^Control 
Signal 



decrSnT ^ ^"^^ USed dependin 9 on whether the signal is increasing in strength or 

^irrRrfi^'^'r'T, 1 , 6 " R s{a ¥f t wAh one dela V element - They have the same response as an 

S 9 , Pu J ^ Pe fi ' ter 15 US6ful f0r obtainin 9 a very low cutoff frequency with little processor 
overhead. It can be implemented by one line of code. ( y = k1*y + k2*x;) The down side is it's poor 
frequency response as can be seen in the following plots. 

in>-k2^r) Sl0W A out '">— K2^g) ^ | > out 

Lki-0J L k3 _gJ 



ys n := kl ys n _ , + k2 x 



yf :=k3 yf l+ k4 x n 



f:=0,5f..fs z(f):=e (>2 '" 
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Hfl>.):= — AKf):=20-loB(|HHz(0)|) 



As(f):= 20-log(|lI S (z(f))|) 



UR Frequency Response 




Frequency(Hz) 

The following code segment performs the AGC function: 

mag = sqrt (Samp .x*Samp . x + Samp . y*Samp . y) ; 
if{ mag > m_AGCave ) 

m_AGCave = (1 . 0-1 . 0/250 . 0) *m_AGCave + ( 1 . 0/250 . 0) *mag; 

else 

m_AGCave = (1 . 0-1 . 0/1000 . 0) *m_AGCave + ( 1 . 0/1000 . 0) *mag; 
if( m_AGCave >= 1.0 ) // divide signal by ave if not almost : 
( 

m_BitSignal.x /= m_AGCave; 
m_BitSignal.y /= m_AGCave; 
m_FreqSignal.x /= ra_AGCave; 
m_FreqSignal . y /= m_AGCave; 



3.9. Frequency Error Detection/Correction 

The first attempt at frequency control used a modified Costas loop PLL in order to achieve coherent 
signal detection 5 . The method worked for BPSK but required a very narrow bandwidth loop filter in order 
to lock on QPSK signals. This work was abandoned in favor of wideband frequency locking and non- 
coherent PSK detection. Any benefits that coherent detection would have gained would probably be lost 
in the HF environment. 

The AFC is now performed by calculating the slope of the frequency within the frequency error filter 
bandwidth and essentially moving the NCO center frequency so that the frequency peak(if one exists) is 
at the center frequency. The phase of the signal is the arctan(l(t)/Q(t)). Since frequency is the derivative 
of phase, the signal frequency is just the derivative of the arctan function. 
The following Mathcad simulation shows this relationship using two different frequency signals. 



F2 := 



Il(t) := cos(2-Jl Fl t) 
Ql(t):=sin(2-7fFIt) 
int(t):=Il(t)*Ql(t) 



I2(t) := cos(2-7t-F2 t) 
Q2(t):= sin(2JcF2t) 
in2(t):=I2(t) + Q2(t) 
Input Signals 1 and 2 
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I - 92 -° 



Phase of Signals I and 2 



tol(t) := ^-Sl(t) C02(t) :=^e2(t) 

dt dt 
Frequency of Signals 1 and 2 



I _.2(«)5 



S„ a n d nf^l C nH o ^J^"? i( ? en,ity WaS ° btained that 9 ives the si 9 nal fr *^ncy as a 

function of the I and Q signals and their denvatives without the use of the atan() function. 



dt \l(t)/ 



/Q(t)\ 2 d,I(t) 
\l(t)/ 



I(0^Q(t)-Q(t)£-I(t) 
Kt) 2 + Q(t) 2 



If the magnitude of the l/Q signal is adjusted to always be equal to one by the use of some AGC then tr 
denominator can be ignored. This allows the implementation of the frequency detector to be 
implemented using two differentiators as shown in the following: 
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The MR LP filter are the same type as used in the AGC section with different time constants. 
1IR Step Response I1R Frequency Response 



J 0.6- 




The transfer function for the differential frequency error block was obtained experimentally and is plotted 
below. It is interesting to note the dip in the function around +/- 16 Hz on the idle(180 deg. shifting) 
signal. This is due to the two frequency components of the idle signal. As long as the error signal doesn't 
change sign at this dip, the loop will still lock correctly at the center frequency. The slope of the transfer 
functions change due to the presence of noise. This is due to the AGC acting on the noise and reducing 
the actual signal level, which as was shown earlier, must remain relatively constant in order for this 
frequency detector scheme to work. The noise level plotted here is right at the threshold of signal 
detection, so is worst case. 
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Frequency Error vs Various Signal Conditions 




■60 -50 -40 -30 -20 




The differential frequency error was originally used to correct the master NCO by itself Now a secondarv 

& tlT ♦h rTOr Signa ' iS , US ? d ° nC t the eTOr 9etS Within 3 Hz - ™ e differential Muency error signal 
is used when the error signal is large, then the phase derived frequency error kicks in when the error 
De comes small. 

This secondary error signal is derived from the difference angle of the baseband signal (described later) 
This error signal is generated by how far from the ideal 0 or n phase position the signal is If the 
anVZZZr i 1 k h Ph T ^ fference of the PSK31 si S nal will be rotated from the ideal position and 
an error metnc can be denved. This method can only be used for a very narrow range of a few Hz and so 
is only used once the main frequency error has dropped within 3 Hz. "Tewnzanaso 

The following is a code snippet from the AFC control: 



tdefine FCROSS (.048) //freq wher< 

fdetine FEMIN ( FCROSS/FERRLPG) 
if( <ra_FerAve < -FEMIN) || (m_FerAve > FEI 
m_FerAve - (1 . 0-1 . 0/FERRLPK) *m_Fer 



if( (mJJFreqError > -FCROSS) 5S (m_FerAve <= 0.0) ) 

else m - FerAve = <1 • 0/FERRLPK) *m_FerAve + (1.0/ (FERRLPG*FERRLPK) ) *m_QFreqError; 

m FerAve = (1.0-1. 0/FERRLPK) *m_Fer 
if( (m_QFreqError < FCROSS) Si (m_FerAve : 

else : "- FerAve = U-O-l-O/FERRLPKl'mJTerAve + (1.0/ (FERRLPG*FERRLPKJ ) *m_QFreqErr 
m_FerAve = (1.0-1 .0/FERRLPK) *m_FerAve + 



(1.0/ ( FERRLPG * FERRLPK ) ) *fei 



1/ (FERRLPG'FERRLPK) ) *ferror 
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3.10. Symbol Synchronization 

The next level of synchronization is to find the center of each symbol in order to sample it at the optimum 
time. Several schemes were tried with varying success. The classic early-late synchronizer was tried that 
integrates the signal energy over part of the symbol time and then again with a small time delay. An error 
signal can be obtained that is fed back to adjust a symbol clock. This works but had problems with noisy 
QPSK signals. Another method was tried using an algorithm that selectively finds the peaks and valleys 
in each I and Q signal 7 . This method worked OK but was complicated. As a side benefit it could provide 
a good signal quality metric that worked well with very noisy signals. However, the final method chosen 
was a simple method that seems to work quickly and well is shown by the following diagram. 

Separate Filter For Each Sample 
Time Within a Sample Period 



y(Q) | — ° 



3 |LP Filter (1 ) ~| o 

LP Filter (2 ) ~| o 



;r(19) | o 




Symbol Period 
Counter and index 
Generator 



There are about 20 samples per symbol at the 612.5 Hz. Sample rate. The energy in the input signal at 
each sample time is individually filtered and stored in a filter array. At each symbol period of .032 
seconds, the filter that has the most energy is selected and the sample point associated with this sample 
is assumed to be the center of the data symbol. 

The LP filters are again the simple IIR's with the following characteristics: 



IIR Step Response 



IIR Frequency Response 
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The following is a segment from the bit sync routine: 



(sample. y'sample.y) ; 

D.O) *m_SyncAve[BitPos] + (1 .0/100 . 0) -energy; 
. / see if at middle of symbol 

Trigger = TRUK; 

P Doc->m_Sync:Hist|m_PkPos] . < INT) ( 10000 . 0* m _SyncAve [m_PkPos J > .- 



INT) CJ000.0»m_SyncAve[BitPos]); 
// halfwa 



3. 1 1. Squelch Function 

To implement a squelch function, a measure of signal quality is needed. It was hoped that by using some 
statical measures on the symbol sample bin energies, that a squelch function could be derived 
Unfortunately, the distribution on these bins does not change very much until the signal gets very' noisy 

SL!f nf P t , '^T ° f Si9nal qUaNty - The answer ™ b V l00ki "9 1 at the vector 

display of Peter's PSKsbw program. Visually one can see how good the signal is by noticing how the 
incoming signal vectors are distributed. A strong signal has most of the vectors tightly distributed around 
the perpendicular ax.s. A no.sy signal will have a wide distribution around the axis. By creating a 
5! ?.?,,? e „' n Ti n i Signal angles and lookin 9 at the avera 9 e deviation that they take from the 
ideal 0, 90, -90, and 180 positions, one can extract a signal proportional to signal quality In order to 
use the same algorithm for BPSK and QPSK. only the vectors around 0 and 180 degrees are anafyzed 
' t ^'" comin g J'gnal d.fference angle must be found. The direct approach would be to just take the 
arctan(Q/l) and subtract the previous arctan(Q/l) from it to get the difference angle. 



Kt) > 

Q(t)— ) 



atan(Q/l) 



Symbol 

Time 

Delay 



Difference 
Angle 



12/28/99 Moe Wheatley, AE4JY 



Copied rn m 106991 // on ()?/! : ?()()/ 



This method has a couple of problems. One is that if l(t) is zero things blow up. Second, there must be 
extra logic to do the subtraction since the atan function doesn't return a nice 0 to 360 degree range. One 
must keep track of which quadrant the vectors are in and subtract accordingly. 
A different approach is used that creates a third vector whose angle is the difference angle but does not 
use the atan function. This vector is created geometrically and so does not suffer from the discontinuities 
of the transcendental atanQ function. The atan function can now be used on this new vector to find ifs 
angle directly. The atan function can still blow up but only if both I and Q are zero which is less likely to 
occur. 

To create this third difference vector one simply multiplies the current sample l,Q vector by the complex 
conjugate of the previous l,Q vector. 







= Vector whose angle is the difference between the original vectors. 



In terms of the complex coordinates I and Q, 
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In diagram form 



this implementation is show below. 





Q(t) 




(z>— 

+ 

^ 1 




atan( ) 



^Difference 
'Angle 



In actuality, a 'C function atan2(y,x) is used to obtain the angle. It returns a value from -PI to +PI This 

XSHSTXil °Z of *fT t U hiSt ° 9ram ° f an9leS around 0 and ?™Sd PI 0 and 
ISOdeg). If one swaps I and Q, the affect is to map 0 and 180 degrees to +/- 90 deqrees This translate 
the two ranges of interest( -P./4 to PI/4 and 3PI/4 to 5PI/4), into the new ranged -of Z?to 3pS.7nd 5R/4 





Pi/4 to 3Pi/4 5Pi/4 to 7Pi/4 

(range around (range around 
180 deg.) Odeg.) 
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The basic signal quality scheme is then implemented as follows: 




If the incoming difference angle is in the "ZERO" range, the absolute value of the angle and Pi/2 is found. 
If the incoming difference angle is in the "180 degree" range, then the absolute value of the angle and 
3Pi/2 is found. The two values are added together and run through a low pass filter. This is a measure of 
how far away from the ideal the angle is, and is used for squelch control and signal quality display. 

Peter Martinez specified a feature into the PSK31 signal scheme in which each transmission should begin 
with a string of at least 32 consecutive 180 degree shifting "idle" symbols and also each transmission 
should end with a string of 32 consecutive 0 degree non-shifting symbolsfsteady carrier). The reason 
was twofold. The beginning idle string gives the decoders a chance at synchronization before any data is 
sent. It can also be used for squelch functions to indicate a transmission is starting. The trailing carrier 
can also be used to deactivate the squelch since a string of 32 "0 degree" symbols cannot occur during 
any data transmission. 

In WinPSK two counters are used to count consecutive idle characters and solid carrier symbols. If either 
reaches it's limit, it bypasses the normal slow acting signal quality signal and forces the squelch either on 
or off. This gives a much quicker acting squelch under good signal conditions. 
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Signal Quality vs SNR 




Note how the QPSK signal with very high SNR still has a lower quality signal. This is due to ISI from the 
bit filter giving the signal some phase jitter. 

This block is also used to derive the secondary frequency error signal that is used along with the main 
differential frequency error generator to lock onto the center frequency. The error signal is taken from the 
signal qualrty block before the absolute value function and IIR filter. This signal then has the sfgn and 
magnitude information that can be used to nudge the main mixer NCO toward the true center frequency 
This error signal kicks in when the overall frequency error is less than 3 Hz. "«Muency. 
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The following is the basic squelch/signal quality function: 

i£( (angle >= PHZ_180_MIN) ss (angle <= PHZ_180_MAX) 

i f ( (pDoc->m_pSettings->m_ModType == QPSK_MODE) 
temp = PI2/4.0 - angle; 



m_QFreqError = ( 1 . 0/ 12 . 5) "temp; . //normalize same as freq e 

i£( temp < m_DevAvel 

m_DevAve= ( 1 . 0- 1 .0/50 . 0 ] <m_DevAve + (1 . 0/ (50 . 0) ) * temp; 

else 

m_Of£CouiTt = 0; 

if(m_QnCount > 20 ) // fast squelch counter 

m_DevAve = 0.1; 

m_OnCount++; 



cings->m_UseLSB) ) 
3 freq error 



n_DevAve= (1.0-1.0/50.0) *m_DevAve 
n_DevAve= (1.0-1. 0/100. 0) *ra_DevAve 



lelchOn - FALSE; 
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3. 12. Symbol Decoding 

^l^c T h S «° C0 " Vert ' , G ' 3nd Q Signals back int0 the four P° ssible symbols (two for BPSK) One 
could use the difference angle as described in the squelch section and find the nearest 0 90 90 or 180 
degree pos.hon and that would be the symbol to use in the decoder. This is actuall^mplem en red in 
WmPSK and used if the user de-selects the following experimental symbol decoder ' mp ' emented In 

^h?,L eStin9 b ff W3S Written , by Y u ri ° kUnev that describes an algorithmic method of decoding 

rirrco^p— 

Let all possible original transmitted signals over k-1 symbol times be: 

5,(0 - tf,(/)cos(fl») + SQXt)sm(<p) Where i = 0,2,3...k-1 and <p is an initial unknown phase. 
The received signal is .r(/) = S,(t) + «(/) where n(t) is White Gaussian Noise. 

toSjUSCS? j""' be minimum if the receiver picks ,he Signal Si(t) such 0131 * e fol,owin 9 inec i ualit y 

[ ]*(0S/,.(0<ft] + [' pW(/)df] >[ < Tx(/)57//)rf/]\j^7i)5e > (/M/] 2 
SSI?^ ^ '°° k 31 ^ iS t0 maXimUm Va ' Ue f ° r a " p0Ssibilities of S M of the left side <»f the 



S s e S[s al ( S k-i a )T be SPNt int ° SeParate inte9ra ' S f ° r 6aCh Symb °' time Spanned ^ *» P rocess whi <* in 



v,= J x(t)SI(t) j dt + jx(t)SI(t) J dt+ fx(t)SI(t).dt\ 

L(»-*-»r <-* )r {n . l)T J 

+ J x(/)Sfi(/) y df+ jx(t)SQXt)jdt+ \x{t)SQ(t).dt\ 



Now SI It) = cos(w/ + A^.) the projection of S(t) on the real axis and SQ,(t) = sin( wt + A6 ) 
the projection of S(t) on the imaginary axis where A9 is the phase information for that symbol period. 
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= I .r(Ocos(H'/ + A0 ( „_ i+ , ) )<//+ J x(/)cos(w/ + A0 ( „_, +2) )<ft + J;c(/)cos(w/ + A0„)a7 

|_<»-*-l)J (""W («-!)?■ J 

J jt(/)sin(w/ + A6(^ lt+1) )^+ Jj(Osin(vW+A0 (n . ;t+2) )rf/+ Jx(/)sin(w/ + A0„)rf/ 



For BPSK signals, A6 can be 0 or jt. 

For QPSK signals, AG can be 0, 7t/2, -nl2, or n 



3.12.1.1. Maximum Likelihood Detector 
For example let's look at a BPSK signal over 3 symbol times(k=0,1 ,2). The table shows all possible 
signal variants. 



(n-2) signal phase 


(n-1) signal phase 


(n) signal phase 


AG at sample n 


tot 


cflt+0 


cot+0 


0 


cat 


cot+0 


COt+7l 


71 


cot 


fflt+lt 


cot+0 


7C 


cot 


C0t+7t 


C0t+7C 


0 



Using trig identities and substituting all possible AG's for a signal over k-1 symbol times, one can 
represent the above in terms of the l(t) and Q(t) signals. 



sin(atf + n) = - sin(cwr) 



cos(fi>f + n) - - cos(cur) 



so substituting all possible angle variants: 

v, = I ~f x(t)cos(wt)dt + Jx(Ocos(wf + 0)dt + Jx(/)cos(w« + 0)dt 



+ J x(l)&ia(wt)di + \x(t)sin(wt + 0)dt + jx(f)sin(>f + 0)dt 
v 2 = J x(t)cos(wt)dt + Ix(t)co&(.wt + 0)dt-]x{t)cos(wt)dt 



+ j x(l)sia(wt)dt + jx(t)sm(wt + 0)dt- j x(t)sm(wt)dt 

l(n-l)T (-.-2)7 (n-l)7 

and so on for all 4 variants. 
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Using the I and Q notation for the integrals one can simplify the equation sets. 
/„= j x(t)cos(ot + A9)dt Q n = jx(t)sm(cot + Ae)dt 



"1 = k- 2 + + I„f + [e„- 2 + 2„-i + a]' to A9 = 0 at sample time n 

v 2 = Un-2 + '„-■ - /. f + [a.j + 0.-1 - a]' for A6 = r at sample time n 

v 3 = [ ',-2 ~ + K ] 2 + [0.-2 " 0„-, + 0„f for AG = k at sample time n 

v< = [/„. 2 - /.., -/„]* + [fi„_ 2 - Q nA -Q a f for AG = 0 at sample time n 

riceiledT^bd." 1 ° f V1 10 V4 hi9heSt probabilit y A9 corresponding to that variant is used for ine 

WinPSK looks at all possible variants over4 symbol periods. This involves looking at a total of 16 
variants to make a deasion on the BPSK symbol. The new symbol is shifted into a shift register and 
when an inter-character sequence of two consecutive zeros is received, the complete character is 
decoded back to ASCII by using a "reverse Varicode" lookup table. 





4 Symbol 

Period 

Maximum 

Likelyhood 

Calculator 

block 


4 


10 11 12 13 K 


^Elr^iElu^Ll] — * 

Symbol Period ' } 

Delay Elements . 




BPSK 1 
Output 
Bit 


QO Q1 Q2 Q3 ^ 





Varicode Decode 


^Decoded 


Look-up Table 


^Character 


TTTT-T 




Shift Register/logic 
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The following is the code to extract the BPSK symbol from all 64 combinations of 4 symbols worth of 
data: 



13 ( 



_I3 - 



puw (m_Q3 

(ra_I3 + m_12 + m 
pow(m_Q3 - m_Q2 

pow(ra_03~- m 02 

max ) ( max ■■= v 
(m_I3 + m_I2 - m 
pow(m_Q3 + m_Q2 

max ) { max = v 

po~~w<m_Q3~+ m_Q2 
x ) ( max - v 

. (m_Q3 ~- m_Q2 ~- m_Q] 

_I3 - m_I2 - m_Il - n 
pow(m_Q3 - m_Q2 - m_Q} 
max ) ( max = v; symb 

t m_I2 + m_Il - r, 
pow (m_Q3 + m_Q2 ~ 



pow(ra_Q3 - m_( 
13 - m_I2 + 



pow (m_Q3 - m_Q2 - 
3 - m_I2 - m_ 



M_N0CHANGE, 
QO, 2 ) ; 



// v[4] 0 deg. 
// v[5] 0 deg. 



// v[ll] 
// v[12] 
// v[13J 

// vim 

// v[15) 
// v[16] 



80 deg 
80 deg 



3.12.2.1. Maximum Likelihood example 

For another example let's look at a QPSK signal over 2 symbol times(k=0,1). The table shows all 
possible signal variants. 



(n-1 ) signal phase 


(n) signal phase 


A8 at sample n 


cot 


cot+0 


0 


cot 


COt +71/2 


+7C/2 


cot 


COt-7C/2 


-71/2 


cot 


C0t+7t 





One can use the following trig identities to find all variants with respect to I and Q. 
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sin(cot + n) = -s'm((Qt) 
sin(a» + -^) = cos(a») 



cos(ca + n) = -cos(cot) 



v 2 =[/„-, +&f +[&,-■ "/J 

v 4 =[/„., -/„] 2 +[fi,-.-a,f 



for A8 = Oat sample time n 
for A6 = +W2 at sample time n 
for AG = -7t/2 at sample time n 
for AG = n at sample time n 

SSled TymbT 1 ° f ^ * ^ ^ ^ h ®"* A ° corres P ondin 9 to »»at variant is used for the 

WinPSK looks at all possible variants over 4 symbol periods. This involves looking at a total of 64 
vanants Because the next step is to decode the incoming QPSK symbols using a soft Viterbi decoder 
the max,mum of each symbol possibility^. + 90, -90. and 180 deg.) fs calculated normalized s St the 
max,mum ,s always equal to one and -log() is taken to get a probability measure that the vlerbi decker 
can use as a metnc in deciding the bes t decoding pa th as described next. "ecoaer 

>SYM_00 likelyhood 
►SYM_01 likelyhood 
>SYM_10 likelyhood 
>SYM_11 likelyhood 




3.12.2.2. Soft Viterbi Decoder 

The Viterbi Decoder tries to reconstruct the original transmitted signal by looking at the sequence of 
received signals and comparing it to all the possible transmitted sequences. The sequen^for Jth) that 
has the best match is chosen to provide the best guess at a current data bit 
The process is similar to being lost in your car and you try to find out where you are on a map bv 
observing how far you have traveled, which way you have turned, etc. By looking at all possible roads 
on the map with the same turns and distances that you've taken, you pick the roads that best match vour 
route and conclude where you are. Viterbi added a simplifying step where if two paths end up afSe 
same intersection, you p.ck the path with the best match at that point, and eliminate the other The paths 
that are left(surv.vors) are the only ones left in contention for future calculations 
The details of the Viterbi algorithm can be found in most communications books 9 so it wont be discussed 
in much detail hera Reference 10 dedicates an entire chapter on this algorithm and referent prSdS a 
good step by step description of the algorithm with lots of trellis drawings. provides a 



12/28/99 Moe Wheatley, AE4JY 



Copied from 106991/7 on 02/11/2007 



WinPSK uses a modified implementation of a Viterbi decoder described in an article by Peter Martinez. 
Soft decision capability was added and the path metrics converted from integer to floating point 
representation. 

Recall how the transmitted data is encoded using the following state machine. Note that 4 memory stages 
plus the current data bit is used to form the output symbol. Every input bit causes the state machine to 
transition to one of 16 possible states. The 2 bit output symbol is derived from the state and the input bit. 



2 bit 

Symbol ^ s0 



-\x4 

<"|S0 1 1 1 11 1 



The Viterbi decoder uses the same encoder to try all possible transmitted combinations and calculate an 

error metric based on how "far away" from each possibility the received symbol is. 

The algorithm is executed at every symbol period with the most likely symbol metric contained in an 

array( m_QPSKprob[ ] ) whose indexed elements contain the value calculated by the symbol decoder. 

These values are zero for the most likely symbol and a positive value for the other three symbols 

depending on how confident the symbol decoder was in the decision. 

For example: 

m_QPSKprob[ SYMB_00] = metric for SYMB_00 =0.3 

m_QPSKprob[SYMB_01] = metric for SYMB_01 =0.0 (SYMB_01 is most likely symbol) 
m_QPSKprob[ SYMB_1 0] = metric for SYMB_10 = 0.4 (SYMBjl 0 is least likely symbol) 

m_QPSKprob[SYMB_11] = metric for SYMBJ 1 =0.1 

The algorithm begins by filling two temporary 32 element arrays in the following manner. 

The index into the arrays is all possibilities that the constraint length 5 encoder can have.(2 5 =32) 

For each possible index, a new path distance is computed by adding the existing survivor path distance 

and the new symbol's error distance. A second array (bitestimatesQ) gets the new bit pattern estimate 

from the existing survivor paths plus the new bit combination being examined. The minimum of all the 

path distances is kept for use later normalization. 

for(i = 0; i < 32; i++) // calculate all possible distances 

{ //lsb of ? is newest bit estimate 

pathdist[i] = m_SurvivorStates[i / 2].Pathdistance + m_QPSKprob[ ConvolutionCodeTablefj] ]; 

if(pathdist[i] < min) // keep track of minimum distance 
min = pathdistfj]; 

// shift in newest bit estimates 

bitestimatesp] = ((m_SurvivorStates[i / 2].BitEstimates) « 1) + (i & 1); 



The next step is to index through all the path distances and eliminate all the paths that reached the same 
state but have higher path distances. The shortest path is copied into the survivor state array along with 
the associated bit pattern estimate. Note also that the minimum value calculated earlier is subtracted 
from the total path distance before being saved into the survivor array. This keeps the value from 
growing over time and remain bounded. 



12/28/99 Moe Wheatley, AE4JY 



Copied from 106991/7 on 02/11/2007 



for(i - 0; i < 16; i++) //compare path lengths with the same end state 

j '/ an d keep only the smallest path in m_SurvivorStatesQ. 

if(pathdistp] < pathdist[16 + i]) 



m_SurvivorStates[i].Pathdistance = pathdist[i] - min 
m_SurvivorStates[i].BitEstimates = bitestimatesp]; 



m_SurvivorStates[i].Pathdistance = pathdist[16 + i] - min 
m_SurvivorStatesfi].BitEstimates = bitestimates[16 + i]; 



} 



Finally the survivor state array bit estimates are examined 20 bits back in time, and the majority of ones or 
zeros is used as the best estimate for the transmitted bit. If there is a tie, a fair "coin toss" is usee I to 
guess at the bit. It has been shown that calculating over 4 or 5 times the constSlerJh doe nS 
significantly improve performance of the Viterbi decoder. 

ones = 0; 

for(i = 0; i < 16; i++) // find if more ones than zeros at bit 20 position 

ones += (m_SurvivorStates[i].BitEstimates&(1L « 20)V 
if(ones==(8L«20)) V "' 

return ( rand() & 0x1 000 ); //if a tie then guess 

else 

returnfpnes > (8L « 20) ); //else return most likely bit value 
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3.13. Display Signals 

WinPSK provides several views of the received signal for use in tuning or for determining the quality. All 
the display signals are calculated separately from the main receiver using a single worker thread to 
process and plot the data. A single "TAB" control allows switching between several different types of 
views using the same screen area. 

3.13.1. FFT for Spectrum Display 

A 2048 point FFT is implemented to obtain frequency information from the received signal. A slightly 
modified version of Takuya OOURA's original radix 4 FFT software package was used to perform this 
function. Log output and a Hanning windowing function was added. The data for the FFT is obtained 
prior to down mixing so the frequency points do not move when changing receive frequency. The FFT 
data sample rate is 5512..5 Hz so the frequency range is 0 - 2756 Hz with a resolution of about 2.7 Hz. 
The decimation filter starts to roll off at about 2200 Hz so the display is limited from 100 to 2200 Hz. 
Display zooming is performed by scaling the index into the FFTs output not by changing FFT size or 
sample rate. 

A standard amplitude vs frequency plot is available as well as a "Waterfall" type in which the amplitude is 
displayed as intensity/color, the x axis is frequency, and the v axis is time. 

3.13.2. Vector Display 

A vector display similar to the one used in "PSKsbw" shows the incoming signal phase. It is overlayed on 
top of the main spectral views. Because WinPSK works on blocks of symbols at a time, the display 
throws up several vectors at once rather than one at a time. Also the amplitude of the vectors is constant 
rather than changing with input signal amplitude. 

This display is useful for seeing if a signal is BPSK( vertical vectors only) or QPSK(vertical and horizontal 
vectors). It is also a good test to see if a signal is tuned exactly on frequency. WinPSK's AFC can be 
fooled and so manual frequency adjustment may be required. Tweaking the frequency for true vertical an 
horizontal vectors will assure frequency correctness if the AFC has problems on noisy or distorted 
signals. 

3.13.3. Input Signal 

The Input signal view is just a scope-like view of the raw input signal prior to down mixing. If s main use is 
to adjust the soundcard input level to keep from overloading the A/D. The trace color will change from 
green to red on peaks that get within 3dB of the limit. 

3.13.4. Sync histogram 

The sync display graphically shows the energy histogram of the bit sync routine. The center of the bit 
should be at the peak of this display and is shown as an extended line to distinguish it. This display is 
interesting in that it can be used to show if the received signal bit rate is off frequency (Assuming your 
own soundcard is on frequency). 

If the bit center drifts noticeably, say 1 5 seconds to drift across the screen, then either yours or the 
sending soundcard is off frequency by an amount greater than can be explained by simple crystal 
inaccuracies. From observing several on the air signals it appears that the sample rates on some 
soundcards is actually 1 1000 or maybe 1 1050 Hz instead of the standard 1 1025. My laptop suffers from 
this problem and uses 1 1050 Hz instead of 1 1025 even though the driver is set to the correct frequency. 
The WinPSK bit sync algorithm will track this off frequency signal but with a phase lag due to the filtering 
and so the center of the bit is off causing receiver degradation on noisy signals. 
As a countermeasure for this, a clock adjustment offset can be set in WinPSK to compensate for off 
frequency soundcards. The value is in ppm(parts per million) and a rough estimate of the clock error is 
calculated while receiving a strong signal and displayed in the program status bar. The problem with this 
is knowing if it is your soundcard or the other guy's that is off. 
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4. Windows Program Implementation 



4.1. PC/Windows Implementation Issues 

The implementation language C/C++ and Microsoft Visual C++ tools were chosen due to existing 
familiarity and experience. The only real decision to make was whether to program the Windows 
application ustag MFC(M<crosoft Foundation Classes) or straight API (Application Programming Interface 
) coding MFC can Provide a lot of ftjnctoonality with little code but is much harder to customize and you 
have to deal w.th the quirky MFC code. Using the API allows the programmer much more control over 
what is going on but requires a lot of tedious coding to implement even the most basic functionality 

In the end, laziness overcame common sense and the MFC route was undertaken. Looking back 
I m still not sure it was the nght decision. Perhaps when the hair grows back that I pulled out while 
dealing with those inane classes, a more objective opinion will ensue. 

4.2. Real Time Considerations 

The first issue to resolve was how much processing could be done using a PC and Windows 
Obviously the basic DSP functions could be done since Peter's and other programs run fine under 
Windows. One goal was to provide bigger signal displays with more resolution and range. Could this be 
achieved in addition to all the underlying signal processing that had to occur? 

Windows is NOT a real time operating system. The response time to any event such as mouse 
key, soundcard, or any other event, is not bounded or specified. It is within "spec" if your mouse 
hesitates a few hundred milliseconds while your modem disconnects. This at first would seem to be a 
death sentence for a signal processing application that must process thousands of samples per second 
without missing a beat. The key to resolving this issue is buffering, lots and lots of buffering As lonq as 
the average processing time your program gets is more than the average amount of time required to 
perform a task, buffering can be used to fill in the times while the processor is away doing other things 

There still is no guarantee that your application will not starve. Screen savers are probably the 
most ill behaved applications around in this regard. Some CD rom device drivers and floppy disk drivers 
can also consume way too much processor time. The best you can do is try to adjust your buffering to 
nde through most of the processor interruptions. A couple of seconds worth of buffering is probably not 
out of line. A couple hundred milliseconds may work if no other app is running and you don't move the 
mouse around much or access the disk. Windows 2000 promises to allow more control over the 
multitasking quantum times but don't hold your breath. 

One feature added to Windows starting with Win95, was multitasking. This allows the 
programmer to split up tasks into independent sections of code as if a separate processor was beinq 
used to execute that task.(With a multiprocessor NT system that can actually be the case) One of the 
biggest conceptual hurdles in moving from embedded or DOS applications to Windows applications is 
dealing with the fact that your program is essentially called by the operating system based on messaqes 
sent to it by the mouse, keyboard, etc. The use of multitasking threads allows one to write code that does 
not rely on Windows message processing which is slow and unpredictable with respect to response time 

WinPSK uses two worker threads. One is used to read/write soundcard data and perform all the' 
DSP functions to receive and transmit PSK31 signals. The second thread is used to display signal 
information graphically. This leaves the main process thread of the program to do the normal mouse 
keyboard, windows stuff. 



4.3. Float vs. Integer Implementation 

Another issue to decide is whether floating point math could be used instead of integer math for the DSP 
algorithms. Floating point math makes the programming part much easier since overflow and underflow 
issues diminish as well as built in functions to calculate sines and tangents and stuff are available The 
question is whether there is enough horsepower in a Pentium class processor to do such things A 
simple test program was written to compare the processing time of doing a simple DSP function a FIR 
filter. One thousand samples were run through a one thousand tap FIR using 32 bit integer 16 bit 
integer, single precision floating point, and double precision floating point math. 
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// code segment for timing FIR function 

((define TESTBUFSIZE .1000 

typedef short TEST_TYPE; 

//typedef int TEST_TYPE; . 

//typedef float TEST_TYPE; 

//typedef double TEST_TYPE; 

TESTJTYPE* pBufl; 

TEST_TYPE* pBuf2; 

TESTJTYPE* pBuf3; 

INT i,j; 

TEST_TYPE acc; 

acc = (TEST_TYPE) 0; 

for(i=0; i<TESTBUFSIZE; i++) 

( 

for(j=0; j<TESTBUFSIZE; j++) 
( 

acc = acc + P Bufl[i] * pBuf2[j]; 

} 

P Buf3[i] = acc; 



The following table presents the results on several different processors and operating systems. 



Data Type 


133 Pentium Win95 


400 Pentium II Win98 


500 Pentium III NT4.0 


16 bit integer 


1 14800 uSec 


36200 uSec 


29000 uSec 


32 bit integer 


99400 uSec 


9100 uSec 


7280 uSec 


Float 


45800 uSec 


8850 uSec 


7070 uSec 


Double float 


69500 uSec 


17500 uSec 


7110 uSec 



From this it was found that single precision floating point is actually faster than integer arithmetic. This 
sort of makes sense because a separate floating point unit runs in parallel with the main CPU. 16 bit 
integer arithmetic is much slower than 32 bit probably because the native word size in the Pentium is 32 
bits. WinPSK uses double precision floating math since there is not that big a hit in performance and 
most of the library math functions use doubles for arguments. 



4.4. PC Soundcard Settings 

The PC soundcard can be set to various modes of operation. For WinPSK, the fundamental sample rate 
was chosen to be 1 1025 Hz. This is a common sample rate that must be supported by any PC 
soundcard running with Windows. Unfortunately, many soundcards do not implement the 11025 Hz 
sample rate correctly and round it off to 11000 or 11050 Hz as discussed earlier. 

The 16 bit mode is used for all soundcard I/O. This is the number of D/A, A/D resolution bits and should 
not be confused with the BUS width of the soundcard which could be 8, 16, 32 bits depending on the 
type. Also the single channel, mono, mode is used rather than stereo. 
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4.5. Program Structure 



4.5.1. Hierarchy Diagram 



The following diagram shows the major C++ classes that form the WinPSK program The base classes 
are shown for those derived from Microsoft's Foundation Classes. The diiSJSWlJSSlS 



are executed using separate worker threads. 




Worker Thread #1 
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4.5.2. Class Descriptions 

In order to become familiar with Windows programming, one needs to surround yourself with piles of 
Windows programming books, use the online help reference, and download as many examples of 
programs as you can. CodeGuru(CodeGuru.com) is one place that has lots of code examples for MFC. 
Several others exist as well. There are several newsgroups that cater to Windows programming as well. 
Below is a brief description of the major code blocks. The details of the code are not described here but 
the code is fairly well documented for someone who is familiar with C++, Windows, and MFC. The code 
is not textbook stuff so don't think this is the best implementation. This author is still struggling with the 
basics of all this. 

CWinPSKApp 

Main Entry point for the program. Initializes the basic window's document-view classes and handles the 
case of multiple program instances. 

CMainFrame 

Class provides the outer window frame with system and menu controls. It saves and restores the 
program screen position and size settings to the registry. Screen setting men..; and general program 
setup dialogs are handled here as well as status bar functionality. 

CDemoDIg 

This dialog class is used to select a demonstration mode that does not use the soundcard but generates 
a repeating character string and then decodes it. This useful for playing with the various program 
features off line. A built in noise generator simulates AWGN with selectable amounts. 

CSetDIg 

This dialog class is used to enter some general user setup information. 
CWinPSKDoc 

This class is used primarily to store all sorts of program variables that need to be accessed from other 
classes and threads. This is one of the few MFC's that are thread safe. Any class that has a Windows 
message handler cannot be called by any thread except the main process thread. 

CSettings 

This class is just a structure that holds all the user program settings that need to be saved and restored to 
disk. There are some class methods that save and restore the class to a file "Settings.dat" that is in the 
program's path. 

CSplitterWnd 

This MFC class is used to provide three separate view areas on the program screen. The top view area 
is for the received text, the middle area is for the transmitted text, and the bottom area holds the program 
controls, data tuning plots, and macro controls. 

CRcvView 

A class to create and place the receive text control box in the top splitter window. 
CRcvEditCtrl 

A CRichEditCtrl derived class that is a read only edit box where all the received text is placed. It handles 
word wrap, color keying, drag and drop, clipboard operations, etc. 

CXmitView 

A class to create and place the transmit text control box in the middle splitter window. 
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CXmitEditCtrl 
CWinPSKView 

te» classes thai ma separate worker ttraaa r fai* S « SSXSXn ^ 855 J 

CMacrodlg 

CSquelchCtrl 

CTheirCallCtrl 

CPIotData 

This class spawns a worker thread which is used to uDdate the data view cmo 

S=a™^ 

cm 

ClOCtrl 

This class spawns a worker thread which instantiates either thP pc;^ How« >. , . 
depending on the state of the Transmit/Recife button and o^SLf^ZZ M °^ on class 
A separate class, CSound is used to talk to the soundcard E 1 S ^ accordin 9'V- 

serial port is used to provide a simple PTT function f ° 3 ^P' 6 of pins on a 

CSound 
CPSKMod 
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CPSKDet 

The class that takes the PSK31 audio samples from the soundcard and decodes it into a text stream. 



4.6. Miscellaneous Software issues 

4.6.1. FIR Filter implementation 

All the FIR type filters use the same structure. An array containing the FIR coefficients and a circular 
buffer structure with two pointers is used. New data samples are entered by decrementing the inptr and 
placing the new sample into the FirQue at that position. Next the FIRptr is set to the new inptr position 
and then the following is performed N times. For each coefficient pointed to by Kptr, the value pointed to 
by FIRptr is multiplied together and added to an accumulator variable. After each MAC operation, both 
pointers are incremented. Logic is added to deal with the circular buffer wrap around at the N-1 position 
in the FirQue. 



CoefTable 



Kptr — ) 



K(N-1) 




The following code segment shows the general method of implementation. For a decimation type FIR 
filter, the MAC loop only needs to be performed at the new sample rate. For complex data, the MAC has 
to be performed on both real and complex parts of the data. 

for( i = 0; i<BlockSize; i++ ) // process Blocksize new samples 

^ if( -Inptr < FirQue ) //deal with wraparound 
Inptr = FirQue +N-1; 
*lnptr = pln[i]; // enter new sample into Que 

acc = 0.0; 
Firptr = Inptr, 
Kptr = CoefTable; 

while( Kptr < (CoefTable + N) ) //do the MAC's 

( acc += ( ( *Firptr++ )*( *Kptr++ ) ); 

if( Firptr >= FirQue +N ) //deal with wraparound 
Firptr = FirQue; 

pOut[j++] = acc; //save output sample 
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4.6.2. Inter-Class Communication 

A constant aggravation with Windows MFC programming is trying to communicate or access methods 
between the various classes. It is easy to communicate into classes that were instantiated by the callinq 
class. Problems arise when trying to get information back through to the parents of the class For 
example the main view class can create and access classes that it creates but it is more difficult for the 
"children" to talk back to their parents.(probably some deep metaphor here..) The best way is to trv to 
structure your program to minimize the need for back access to classes. If needed one way to qet 
around this is to pass a pointer to the parent class on to the child class as a parameter during creation 
Sometimes this must be cast as a void* then recast back to the parent class to get around some 
problems in the order in which include files are processed by the compiler. 

Another communication problem is calling MFC class methods from worker threads. Most MFC classes 
are not thread safe and will crash. Writing or reading text in an edit box from a worker thread will not 
work. One way around this is to send a windows message from the worker class to the Windows class 
This method is used to transport character data to and from the DSP worker thread to the Window's edit 
boxes for display. The Windows messaging system can have long response times and should not be 
used for any high speed data traffic. 

4.6.3. Processor Loading 

The processor loading was measured using several methods. Windows NT has a performance monitor 
that can show user and kernel processing loads for various processes. Visual Studio also has a code 
profiler that can be used but is klunky and doesnt work very well. 

Most measurements were taken using a system function that returns a 64 bit Pentium timer value that 
can be used to Timestamp" any place in code. 

All the timing values here and commented in the code, are based on a 133 MHz Pentium The followinq 
table gives some average execution times of some major processing blocks. 



Function 


Ave time 


Rate function is 
Called 


CPU Percentage 


Decimate by 2 FIR 


6 mS 


2.69 Hz 


1.6% 


Total PSK Detection Function(BPSK) 


12.5 mSec 


2.69 Hz 


3.4% 


Total PSK Detection FunctionfQPSK) 


16.7mSec 


2.69 Hz 


4.5% 


Decimate by 3 #1 


10 uSec 


18375 Hz 


1.8% 


Decimate by 3 #2 


13 uSec 


612.5 Hz 


.8% 


CalcBitFilter 


25 uSec 


612.5 Hz 


1.5% 


CalcAGC 


9uSec 


612.5 Hz 


.5% 


CalcFreqError 


10 uSec 


612.5 Hz 


.6% 


SymbSync 


8uSec 


612.5 Hz 


.48% 


DecodeSymb(BPSK) 


69 uSec 


31.25 Hz 


.2% 


DecodeSymb(QPSK) 


522 uSec 


31.25 Hz 


1.6% 










Plot Spectrum 


22 mSec 


2.69 Hz 


5.9% 


Plot Waterfall 


20 mSec 


2.69 Hz 


5.4% 


Plot Input 


8 mSec 


2.69 Hz 


2.1% 


Plot Sync 


4.8 mSec 


2.69 Hz 


1.3% 










Transmit Modulation(BPSK) 


15mSec 


2.69 Hz 


4% 


Transmit Modulation(QPSK) 


15 mSec 


2.69 Hz 


4% 



A rough CPU % load value is displayed in the status bar of WinPSK for relative comparisons of different 
CPUs. It does not take into account the main process thread or any system level processing times. 
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Problems/Bugs/Issues 

1 . The receive Window scrolling functions leaves a lot to be desired. The current line many times 
goes out of the scroll view. This requires clicking on the scroll bar to get it back inside the view 
window. More hair pulling with the RichEditCtrl is needed. 

2. The colors of the receive and transmit text sometimes get mixed up. See above comment. 

3. The maximum likelihood detector does not appear to give any noticeable improvement in signal 
reception. The added complexity is probably not justified. 

4. This program was written to experiment with PSK31 reception techniques and so is not a feature 
rich application. I have no plans to add much to it except maybe some bug fixes and minor 
tweaks. 



12/28/99 Moe Wheatley, AE4JY 46 

Copied fn m IO(i )91 // on Q? } 1 : ?()()/ 



